Survey on intermediate goals in AI governance
It seems that a key bottleneck for the field of longtermism-aligned AI governance is limited strategic clarity (see Muehlhauser, 2020, 2021). As one effort to increase strategic clarity, in October-November 2022, we sent a survey to 229 people we had reason to believe are knowledgeable about longtermist AI governance, receiving 107 responses. We asked about:
respondentsā ātheory of victoryā for AI risk (which we defined as the main, high-level āplanā theyād propose for how humanity could plausibly manage the development and deployment of transformative AI such that we get long-lasting good outcomes),
how theyād feel about funding going to each of 53 potential āintermediate goalsā for AI governance,[1]
what other intermediate goals theyād suggest,
how high they believe the risk of existential catastrophe from AI is, and
when they expect transformative AI (TAI) to be developed.
We hope the results will be useful to funders, policymakers, people at AI labs, researchers, field-builders, people orienting to longtermist AI governance, and perhaps other types of people. For example, the report could:
Broaden the range of options people can easily consider
Help people assess how much and in what way to focus on each potential ātheory of victoryā, āintermediate goalā, etc.
Target and improve further efforts to assess how much and in what way to focus on each potential theory of victory, intermediate goal, etc.
You can see a summary of the survey results here. Note that we will expect readers to abide by the policy articulated in āAbout sharing information from this reportā (for the reasons explained there).
Acknowledgments
This report is a project of Rethink Prioritiesāa think tank dedicated to informing decisions made by high-impact organizations and funders across various cause areas. The project was commissioned by Open Philanthropy. Full acknowledgements can be found in the linked āIntroduction & summaryā document.
If you are interested in RPās work, please visit our research database and subscribe to our newsletter.
- ^
Hereās the definition of āintermediate goalā that we stated in the survey itself:
By an intermediate goal, we mean any goal for reducing extreme AI risk thatās more specific and directly actionable than a high-level goal like āreduce existential AI accident riskā but is less specific and directly actionable than a particular intervention. In another context (global health and development), examples of potential intermediate goals could include ādevelop better/ācheaper malaria vaccinesā and āimprove literacy rates in Sub-Saharan Africaā.
- 12 tenĀtaĀtive ideas for US AI policy (Luke Muehlhauser) by 19 Apr 2023 21:05 UTC; 117 points) (
- AI policy ideas: ReadĀing list by 17 Apr 2023 19:00 UTC; 60 points) (
- How ReĀthink PriĀoriĀtiesā ReĀsearch could inĀform your grantmaking by 4 Oct 2023 18:24 UTC; 59 points) (
- āRisk AwareĀness MoĀmentsā (Rams): A conĀcept for thinkĀing about AI govĀerĀnance interventions by 14 Apr 2023 17:40 UTC; 53 points) (
- AnĀnouncĀing the AIPoliĀcyIdeas.com Database by 23 Jun 2023 16:09 UTC; 50 points) (
- SlowĀing AI: ReadĀing list by 17 Apr 2023 14:30 UTC; 45 points) (LessWrong;
- EA OrĀgaĀniĀzaĀtion UpĀdates: April 2023 by 13 Apr 2023 18:48 UTC; 41 points) (
- ReĀthink PriĀoriĀties is hiring a ComĀpute GoverĀnance ReĀsearcher or ReĀsearch Assistant by 7 Jun 2023 13:22 UTC; 36 points) (
- EA & LW FoĀrum Weekly SumĀmary (13th ā 19th March 2023) by 20 Mar 2023 4:18 UTC; 31 points) (
- Ideas for AI labs: ReadĀing list by 24 Apr 2023 19:00 UTC; 28 points) (
- ReĀsearch proĀject idea: InĀterĀmeĀdiĀate goals for nuĀclear risk reduction by 15 Apr 2023 14:25 UTC; 24 points) (
- Please help me sense-check my asĀsumpĀtions about the needs of the AI Safety comĀmuĀnity and reĀlated caĀreer plans by 27 Mar 2023 8:11 UTC; 23 points) (
- AI policy ideas: ReadĀing list by 17 Apr 2023 19:00 UTC; 23 points) (LessWrong;
- Part 3: A ProĀposed ApĀproach for AI Safety MoveĀment BuildĀing: ProĀjects, ProĀfesĀsions, Skills, and Ideas for the FuĀture [long post][bounty for feedĀback] by 22 Mar 2023 0:54 UTC; 22 points) (
- InĀciĀdent reĀportĀing for AI safety by 19 Jul 2023 17:00 UTC; 22 points) (LessWrong;
- InĀciĀdent reĀportĀing for AI safety by 19 Jul 2023 17:00 UTC; 18 points) (
- A ProĀposed ApĀproach for AI Safety MoveĀment BuildĀing: ProĀjects, ProĀfesĀsions, Skills, and Ideas for the FuĀture [long post][bounty for feedĀback] by 22 Mar 2023 1:11 UTC; 14 points) (LessWrong;
- EA & LW FoĀrum Weekly SumĀmary (13th ā 19th March 2023) by 20 Mar 2023 4:18 UTC; 13 points) (LessWrong;
- InĀsights from an exĀpert surĀvey about inĀterĀmeĀdiĀate goals in AI governance by 17 Mar 2023 14:59 UTC; 11 points) (
- Ideas for AI labs: ReadĀing list by 24 Apr 2023 19:00 UTC; 11 points) (LessWrong;
- 15 Feb 2023 0:00 UTC; 3 points) 's comment on FramĀing AI strategy by (LessWrong;
...and while I hopefully have your attention: My team is currently hiring for a Research Manager! If you might be interested in managing one or more researchers working on a diverse set of issues relevant to mitigating extreme risks from the development and deployment of AI, please check out the job ad!
The application form should take <2 hours. The deadline is the end of the day on March 21. The role is remote and weāre able to hire in most countries.
People with a wide range of backgrounds could turn out to be the best fit for the role. As such, if youāre interested, please donāt rule yourself out due to thinking youāre not qualified unless you at least read the job ad first!
Can I ask whether there is a specific reason that you do not put the summary of the findings in this post, but only let people request access to a google drive folder?
I just browsed through it, their reasons for not doing so is also described in a section in the report.
Yeah, the āAbout sharing information from this reportā section attempts to explain this. Also, for what itās worth, I approved all access requests, generally within 24 hours.
That said, FYI Iāve now switched to the folder being viewable by anyone with the link, rather than requiring requesting access, though we still have the policies in āAbout sharing information from this reportā. (This switch was partly because my sense of the risks vs benefits has changed, and partly because we apparently hit the max number of people who can be individually shared on a folder.)